Benchmarking infrastructure for mutation text mining
نویسندگان
چکیده
منابع مشابه
Benchmarking infrastructure for mutation text mining
BACKGROUND Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. RESULTS We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutatio...
متن کاملText Mining Infrastructure in R
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis metho...
متن کاملBenchmarking Infrastructure for Big Data
This paper discusses the applicability of 20 years of experience benchmarking transactional systems (TPC) and storage system benchmarking (SPC) to the design of systems suitable for use in comparing different analytic systems. One of the most challenging problems is finding tools to make meaningful comparisons between substantially different architectures working on solving similar problems. Fo...
متن کاملnala: text mining natural language mutation mentions
Motivation The extraction of sequence variants from the literature remains an important task. Existing methods primarily target standard (ST) mutation mentions (e.g. 'E6V'), leaving relevant mentions natural language (NL) largely untapped (e.g. 'glutamic acid was substituted by valine at residue 6'). Results We introduced three new corpora suggesting named-entity recognition (NER) to be more ...
متن کاملDiMeX: A Text Mining System for Mutation-Disease Association Extraction
The number of published articles describing associations between mutations and diseases is increasing at a fast pace. There is a pressing need to gather such mutation-disease associations into public knowledge bases, but manual curation slows down the growth of such databases. We have addressed this problem by developing a text-mining system (DiMeX) to extract mutation to disease associations f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Biomedical Semantics
سال: 2014
ISSN: 2041-1480
DOI: 10.1186/2041-1480-5-11